Tutorial on Lempel-Ziv Data Compression Algorithm
نویسنده
چکیده
In many scenario of digital communication and data processing, we may deal with strings of data which have certain structural regularities, making it possible for time-saving techniques of data compression. Given a discrete data source, the data compression problem is first to identify the limitations of the source, and second to devise a coding scheme which will best compress it subject to certain performance criteria. If the relevant source parameters have been identified, the problem reduces to one of minimumredundancy coding. When no a priori knowledge of the source characteristics is available, and if statistical estimation are either impossible or unreliable, one must resort to universal coding schemes whereby the coding process is interlaced with a learning process for the varying source characteristics. Therefore, such coding schemes inevitably require a larger working memory space. The Lempel-Ziv Data Compression Algorithm[1] (sometimes called LZ ’77) is one of universal coding scheme which can be applied to any discrete source and whose performance is comparable to certain optimal fixed code book schemes designed for completely specified sources. This compression algorithm is adapted from the concept of encoding future segments of the source-output by maximum-length copying from a buffer containing the recent past output. The transmitted codeword consists of the buffer address and the length of the copied segment. At the decoding end, the source data can readily be reconstructed with a predetermined initial load of the buffer and the information contained in the codewords. In Section II, this coding scheme will be described in detail. In Section III, we give bounds on the compression efficiency attainable with full a priori knowledge of the source by fixed code book schemes, and then show that the efficiency of LZ ’77 with no a priori knowledge of the source approaches those bounds. Then, in Section IV, we will provide some improvements on the original algorithm. Finally, in Section V, some applications of LZ ’77 will be demonstrated.
منابع مشابه
Towards Analytical Information Theory: Recent Results on Lempel-Ziv Data Compression Schemes
The Lempel-Ziv algorithms are well-known dynamic dictionary algorithms of use in data compression. The talk shows how analytic models originally developed for the analysis of tries and digital search trees may be used to characterize their compression characteristics.
متن کاملA Modified Lempel–ziv Welch Source Coding Algorithm for Efficient Data Compression
Lempel–Ziv Welch (LZW) algorithm is a well-known powerful data compression algorithm created by Abraham Lempel, Jacob Ziv, and Terry Welch. The algorithm is designed to be fast to implement but is not usually optimal because it performs only limited analysis of the data. A modified LZW algorithm on source coding will be proposed in this paper to improve the compression efficiency of the existin...
متن کاملEnhancement Of Lempel-Ziv Coding Using A Predictive Pre-Processor Scheme for Data Compression
In the implementation of lossless compression, the traditional Lempel-Ziv dictionary algorithm produces good compression performance, but at a relatively slow processing speed for use in real-time applications. This paper highlights a technique to speed up the overall compression process through introduction of a prediction pre-processor stage. A variety of predictors are applied to two variati...
متن کاملLempel-Ziv Dimension for Lempel-Ziv Compression
This paper describes the Lempel-Ziv dimension (Hausdorff like dimension inspired in the LZ78 parsing), its fundamental properties and relation with Hausdorff dimension. It is shown that in the case of individual infinite sequences, the Lempel-Ziv dimension matches with the asymptotical Lempel-Ziv compression ratio. This fact is used to describe results on Lempel-Ziv compression in terms of dime...
متن کاملDeenability and Compression
A compression algorithm takes a nite structure of a class K as input and produces a nite structure of a diierent class K' as output. Given a property P on the class K deened in a logic L, we study the deenability of property P on the class K'. We consider two compression schemas on unary ordered structures (words), a naive compression and the classical Lempel-Ziv. First-order properties of stri...
متن کامل